The UPC text-to-speech system for Spanish and catalan
نویسندگان
چکیده
This paper summarizes the text-to-speech system that has been developed in the Speech Group of the Universitat Politècnica de Catalunya (UPC). The system is composed of a core and different interfaces so that it is compatible for research, for telephone applications (either CTI boards or standard ISDN PC cards supporting CAPI), and Windows applications developed using Microsoft SAPI. The paper reviews the system making emphasis in the parts of the system which are language dependent and which allow the reading of bilingual text (Spanish and Catalan). The paper also presents new approaches in prosodic modeling (segmental duration modeling) and generation of the database of speech segments, which have been introduced last year.
منابع مشابه
Development of Language Resources for Speech-to-speech Translation
This paper describes the creation of linguistically enriched aligned corpora for Catalan, Spanish and US-English for Speech-to-Speech Translation. These corpora are obtained from two diierent sources: US-English transcribed speech data and transcriptions of conversations recorded in Catalan and Spanish. After human translation, a large trilingual spontaneous speech corpus has been obtained. Thi...
متن کاملMonolingual and Bilingual Spanish-Catalan Speech Recognizers Developed from SpeechDat Databases
Under the SpeechDat specifications, the Spanish member of SpeechDat consortium has recorded a Catalan database that includes one thousand speakers. This communication describes some experimental work that has been carried out using both the Spanish and the Catalan speech material. A speech recognition system has been trained for the Spanish language using a selection of the phonetically balance...
متن کاملA Bilingual Spanish-Catalan Database of Units for Concatenative Synthesis
Different databases of phonetic units are required in multilingual Text-to-Speech systems based on concatenative synthesis. We are currently developing a TTS system able to convert text either in Catalan and Spanish, with some of the modules being used indistinctly by the two languages while others are specific to each language. In order to reduce the total amount of units, a bilingual database...
متن کاملLenition of /d/ in spontaneous Spanish and Catalan
The present study explores the acoustics of /d/ in two corpora of Spanish and Catalan spontaneous speech. Three acoustic metrics were developed as indexes of articulatory weakening. The findings suggest that variations in the implementation of /d/ result from gradient modulations in constriction degree on a unimodal statistical-acoustic distribution. The preceding segment is a strong predictor ...
متن کاملBilingual aligned corpora for speech to speech translation for Spanish, English and Catalan
In the framework of the EU-funded Project LC-STAR, a set of Language Resources (LR) for all the Speech to Speech Translation components (Speech recognition, Machine Translation and Speech Synthesis) was developed. This paper deals with the development of bilingual corpora in Spanish, US English and Catalan. The corpora were obtained from spontaneous dialogues in one of these three languages whi...
متن کامل